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Abstract 

We develop a fully Bayesian, computationally efficient framework for incorporating 
model uncertainty into Type II Tobit models and apply this to the investigation of the 
determinants of Foreign Direct Investment (FDI). While direct evaluation of model 
probabilities is intractable in this setting, we show that by using conditional Bayes 
Factors, which nest model moves inside a Gibbs sampler, we are able to incorporate 
model uncertainty in a straight-forward fashion. We conclude with a study of global 
FDI flows between 1988-2000. 
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1 Introduction 



We consider modeling the determinants of Foreign Direct Investment (FDI) and incorporat- 
ing model uncertainty in this framework. FDI flows are important for several reasons. For 
one, they can be taken as a measure of growing economic globalization. Every time a firm 
sets up a new subsidiary in a different country they engage in FDI. And every positive FDI 
flow increases the FDI stock in a certain country. The size of the FDI stock shows how much 
this country is integrated into the international market, because FDI usually comes with a 



considerable amount of trade as one third of world trade is intrafirm (Navaretti and Ven- 



ables, 2006). Also, FDI may have a positive effect on the host country's economy. Countries 



may hope for increasing employment and other positive effects like technological spill-over, 
educational effects on the labor force and of course increasing tax income. While tradition- 
ally most FDI occurs between the developed countries, developing countries have growing 
FDI inflows and this trend is not expected to change. 

We consider bilateral FDI inflows from the years 1988-2000 and model the decision to 
engage in foreign direct investment which includes the decisions of how much to invest and 
whether to invest at all. The two-stage nature of FDI decisions disqualifies the use of simple 
linear regression and requires a statistical approach that accounts for sample selection. One 
framework that may be used is the generalized Tobit model, a two-equation linear regression 
model. 

An additional statistical complication arises from the complexity of the FDI decision 
process. This leads to a profusion of economic theories and concomitant model uncertainty 
which we address through Bayesian Model Averaging (BMA). The specifics of Tobit estima- 
tion in a Bayesian context make direct model comparison difficult, due to the inability to 



resolve nested sets of integrals. In a pervious study (Eicher et al. , 2011 ) of FDI determinants, 
this issue was addressed through the use of BIC approximations in a framework based on 
Heckit estimation. While the method was useful at illuminating important FDI determi- 
nants, it stood on tenuous ground theoretically. The variable inclusion probabilities were 
approximations and the entire modeling framework was not strictly Bayesian. We resolve 
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these issues in the developments below. 

We show that by nesting model moves inside a larger Gibbs sampler, we are able to com- 
pute conditional Bayes Factors (CBFs) directly and easily This leads to a procedure that 
simultaneously addresses selection bias and model uncertainty and does so with limited addi- 



tional computational complications. CBFs were originally introduced by Dickey and Gunel 



(1978) and recently proposed in a related econometric setting by Karl and Lenkoski (2012). 



Our new method, which we call Tobit Bayesian Model Averaging (TBMA), thus combines 
CBFs with Markov Chain Monte Carlo (MCMC) estimation to yield a fully theoretically 
valid methodology that is also computationally much more efficient than the procedure de- 



veloped in Eicher et al. (2011) 



We conclude with an analysis of the data provided in Eicher et al. (2011). We show 



that our new method yields broadly similar conclusions as that of Eicher et al. (2011), in 



the determinants of the amount of FDI. However, the two methods differ substantially in 



the selection equation. This is attributed to the fact that the HeckitBMA method of Eicher 



et al. (2011) did not "feed back" information from the outcome equation into the model 



probabilities of the selection equation. Our method, by contrast, updates the two model 
spaces in a joint fashion, which is more grounded in Bayesian theory. Futher, the method 



requires only a fraction of the computing time taken by Eicher et al. (2011), due to the 
efficiency of the CBF calculations. 

The article is organized as follows. Section [2] will review the Type II Tobit model and 
discuss the Bayesian estimation of its parameters. Section [3] will build on these develop- 
ments to introduce the TBMA algorithm that incorporates model uncertainty. Section [4] 
will comprise our main data analysis of FDI determinants. In Section [5] we conclude. 

2 Review of the Tobit model 



FDI flow data contain a large number of 0's, resulting from an issue of sample selection 



(Eicher et al. , 2011). In order to avoid selection bias we use the generalized Tobit model, 



also known as Type II Tobit model or sample selection model. The bias in the OLS estimate 
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arises when manually selecting observations (i.e. only use observations with a positive FDI 
flow) . This problem can be solved by using a system of two equations with a selection and an 
outcome equation thus putting the selection of observations into a probabilistic framework. 
The generalized Tobit model for i e {1, 2, n} is given by 

Zi = wfi + a (l) 

y* = xfi + th (2) 
yi = y*l(z t >0). 

Above, ?/j is the observed dependent variable and y* its uncensored factor. z\ is a latent 
variable that determines whether is observed and Wi, X{ are independent p x 1 and q x 1 
variable vectors with corresponding coefficient vectors 0, (3. Relating to the modeling of 
foreign direct investment, yi is the log FDI flow from one country to another (only one 
direction) and cCj contains the covariates which are believed to influence the magnitude of 
FDI flows. The vector Wi consists of the covariates that influence the initial decision whether 
to engage in FDI. 

It is reasonable to assume that these two vectors share some covariates. Corruption, for 
example, could deter firms because of ethical reasons or fear of legal punishment, while the 
higher cost of doing business could decrease the amount of investment. But the covariate 
vectors could also be entirely different. 

We assume correlation between the selection and outcome process and therefore the error 
term is jointly distributed (e*, rji)' ~u.d. A/"(0, S) with covariance matrix 

1 7 
7 + 7 2 

The entry En can be set to 1 for identification because z t is an unobserved latent variable 



(3) 



and only its sign is of relevance (McCulloch et al. 2000) 



2.1 Bayesian Posterior Determination 

In order to determine the posterior distribution of the parameters in 0-((3]) we follow the 



Gibbs sampler outlined in Omori (2007). This returns a dependent sample whose distribu- 



tion resembles that of 0, (3, 7, (f)\z, y where z = (zi, z n ) and y = (yi, y n )' ■ 

As z is a latent variable and cannot be observed, it will be generated using data aug- 



mentation (Albert and Chib, 1993). Early Bayesian approaches to Tobit estimation also 



imputed the censored observations in y. This however is inefficient (Chib, 2007) and Omori 



(2007) outlines a Gibbs sampler based on the marginalization of the posterior distribution 



over censored observations. 

Let y Q denote the vector of uncensored observations. We define the following prior dis- 
tributions 



0~A/"(0 o ,©o) P~N{P Q ,B Q ) 7~A/"( 7 o,G c 



XS(f,f) 



(4) 



where XQ denotes the inverse gamma distribution. This yields the following joint posterior 
probability density 

pr(z,0,/3,7,0|y o ) oc pr(z, y Q \d, (3, 7, 4>) pr(0) pr(/3) prfr) pr(0), 

where 



W(z,y o \0,f3,j, 



oc <f) 



-n„/2 



cxp j 



E (* - 



i|^i<0 



+ E C 1 + ? )(* - ™^) 2 - - - ^) + ?(«*- ^ 

j|z,;>0 

and n is the number of uncensored observations. The conditional posterior distributions of 
ift — (0 ,/3 )', 7 and are then given by 

il>\z,y ,j,((>~ AT(^i,*i) 

^,0~AT(7i,G 1 ) (5) 
0|«,l/o,V,7~^(^,f). 

A detailed description of the parameters is given in Appendix A. 

The conditional distributions used to sample the latent variables Zj have the form of 
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normal distributions truncated at zero. A censored observation corresponds to a negative 
value of Zi, whereas for an uncensored observation to occur it must be positive. Due to 
marginalization over the censored observations, the way in which mean and variance are 
calculated depends on whether the observation is censored (see Appendix A). 

3 Incorporating Model Uncertainty 

In the case of FDI, model uncertainty means uncertainty about which FDI determinants have 
an influence on the actual FDI flows and thus which economic theories can be supported. 
The two equations in the Tobit model represent different types of influence. Inclusion of a 
determinant in the selection equation means it has an influence on the decision whether to 
invest at all, while inclusion in the outcome equation suggests influence on the magnitude of 
the actual FDI flow. 

In order to explain the motivation behind our CBF approach, we first review some basic 
results from classic BMA literature. We then show how the concept of Bayes Factors can 
be usefully embedded in a Gibbs sampler yielding CBFs. These CBFs are then shown to 
yield straightforward calculations. The section concludes with an overview of the full TBMA 
procedure. 

3.1 Bayes Factors 

In a general framework, incorporating model uncertainty involves considering a collection of 
candidate models X, using the data V. Each model I consists of a collection of probability 
distributions for the data T>, {pr(2}|£),£ G S/} where Sj denotes the parameter space for 
the parameters of model / and is a subset of the full parameter space S. 

By letting the model become an additional parameter to be assessed in the posterior, we 
aim to calculate the posterior model probabilities given the data V. By Bayes' rule 

fr\Tf\ pr(V\I)pr(I) 
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where pr(I), denotes the prior probability for model I EX. 
The integrated likelihood pr(V\I), is defined by 

pr(V\I) = f pr(V\Opr(Z\I)dt, (7) 

where pr(£\I) is the prior for £ under model /, which by definition has all its mass on Hj. 
One possibility for pairwise comparison of models is offered by the Bayes factor (BF), 



which is in most cases defined together with the posterior odds (Kass and Raftery, 1995). 
The posterior odds of model / versus model I' is given by 

pr(I[D) pr(T>\I) pr(I) 
pr{I'\V) ~ pr{V\I') pr{ry 

where 

pr(V\I) pr(I) 
pr(V\r) an pr(I') 

denote the Bayes factor and the prior odds of I versus I', respectively 

When the integrated likelihood ([7]) and thus the BF can be computed directly, a straight- 
forward method for exploring the model space, Markov Chain Monte Carlo Model Compo- 



sition (MC3), was developed by Madigan and York (1995). 



MC3 determines posterior model probabilities by generating a stochastic process that 
moves through the model space X and has equilibrium distribution pr{I\D). Given the cur- 
rent state jw, MC3 proposes a new model I' according to a proposal distribution q(-\-), 
calculates 

pr{V\I')pr(r)q(I^\I') 

a — 



pr(V\I( s ))pr{I( s ))q{P\I^) 
and sets = I' with probability min{a, 1} otherwise setting I^ s+1 ^ = jw. 

3.2 Conditional Bayes Factors 

For the model space of the Tobit model we will be using the Cartesian product of two model 
spaces, one for each equation. Let X be the model space for the selection equation and C 
the model space for the outcome equation. Each model G X and Lj e C defines which 
covariates shall be used in their respective equation and the entirety of models M. = X x C 



covers all possible combinations of covariates for both equations. A model M G M. can 
be interpreted as a specific selection of restrictions ^ m on the vector ip. The entry in the 
parameter vector will be set to if the corresponding covariate should not be included in 
the estimation according to the model in question. 

As discussed above, two competing models could be compared by computing 

pr(V\M) = / / pr(V\ilj,T 1 )pr(ilj\M)pr(T l )di!dE 

Unfortunately, this factor, has no obvious, analytic form. Instead, we focus on the conditional 
Bayes Factor (CBF), 

pr(D|M y ,£) = J prp|M y ,V,S) pr^JWy, £)#, 

The CBF thus calculates the integrated likelihood conditional on a fixed setting of S. In 
appendix B we show that 

|*i,m| 1/2 exp(-^o,A/*o,L^o,M' 



pr(P|M,E) oc 



l*o,Af| 1/2 exp(-|V'i,M*iiV'i 1 M) 
where i[>o,m, V'i.m, ^o,m an d *&i,m are exactly analogous to the paramters discussed in Sec- 
tion [2] but restricted to the subset of variables contained in model M. We see that the 
expression ^ is quite straightforward to calculate and essentially only requires the param- 
eters necessary in the Gibbs sampler. 

We feel a comment on the implication of equation ^ is in order. Remember that for 
linear regression BIC is calculated as 

BIC M = n log(l - R 2 M ) + p M log n 

where n is the sample size, R 2 M is the value of R 2 for the model M and pu is the number of 



parameters included (Raftery, 1995). Assuming a flat prior distribution on the model space 



we can use BIC to approximate the posterior model probability 

pr(M\D) « exp(-§ BIC M ) « ^exp(f^) 

where c is a normalizing constant. The last part of the equation can be interpreted as the 
product of a term penalizing for model complexity and a term measuring the goodness of fit. 




We note that (J8J) can be interpreted in the same manner. The second part of the product in 



where y n is a two-dimensional vector of the latent and dependent variable and X n is a 
2 x (p + q) matrix of covariates (see Appendix A for the definitions). Essentially, this is the 
residual sum of squares for the decorelated and normalized equations of the Tobit model as 
the argument of a strictly decreasing function. A decrease in the residual sum of squares, 
or increase in goodness of fit, leads to a greater integrated likelihood. The ratio containing 
the prior and posterior covariance matrix penalizes for increasing model complexity. We 
therefore see that ^ has a strong relationship to the calculation of BIC, and yet relies on 
no approximations. 

3.3 Tobit Bayesian Model Averaging (TBMA) 

We now review the entire TBMA algorithm that integrates MC3 into the Gibbs sampler using 
CBFs. We assume a flat prior on the model space (i.e. pr(M) oc 1), although other prior 
distributions could be readily accommodated. For a detailed description of the parameters 
see Appendix A. 

1. Initialize if), 7 and (f>. 

2. Sample z\y a , if), 7, 0. 

• For censored observations: Zi\M, if), 7, ~ .A/(_oo ) o)('U^0, !)• 



+ (fin - x n ^)'jr\y n - x n ij> ltij ) 



n\z n >0 



+ (</>« 
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• For uncensored observations: z^i/i, M, if), 7, ~ A/]o,oo)(a^, of) 

3. Sample 7|M,z,?/ o ,V,0 ~ A/"(7i,Gi). 

4. Sample <f>\M,z,y a , ^,7 ~^(f,f)- 

5. Apply modified MC3. 

(a) Sample M' from the neighborhood of M. 

(b) Compute pr(P|M', 7, 0) and pr(D|M, 7,0) as in equation 

(c) Accept M' as new M with probability min {1, CBFm'm}- 

6. Sample ^|M, 2, ?/ , 7,0 ~AT(^i,*i)- 

7. Go to 2. 

By adding one single step to the existing Gibbs sampler we are able to include model un- 
certainty. Furthermore, it is a very efficient way since the calculations for the parameters of 
the conditional posterior distribution of if) are included in step 5. 

4 Application to FDI Data 

4.1 Economic theories and associated FDI determinants 

Every investor considering FDI is confronted with a two-stage decision. Whether he should 
make an investment abroad and if so, how much he should invest. Even though later on we 
will be looking at aggregate FDI flows derived from the balances of payments of different 
countries, these decisions are made at the firm level. Naturally, firm-level data would be the 
best basis for inference but they are almost impossible to come by. 

Probably the most important principle in the theoretic work on FDI is the OLI frame- 



work by Dunning (1980). The OLI principle states that firms decide to engage in FDI if they 
have ownership of goods or production processes that give them market power, if they can 
benefit from having a plant in a foreign location rather than at home and if internalization 
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of the foreign activity provides an advantage over contracts with local firms. Ownership 
and location-specific advantages are reasons why firms would want to produce in a foreign 
country in the first place, and internalization is a way to protect your assets in face of market 
failures. Every time assets are revealed there is a risk of dissipation which would result in a 
loss of value. This concern may lead to difficulties in negotiations for rents of a certain asset. 
Firms would not want to reveal the full asset before a contract is finalized while the potential 
local partner is not ready to pay the full price until the asset has been fully revealed. Even 
after a contract is finalized, there is the risk of hold-up situations when the local partner had 
to make investments in relationship-specific goods. Due to incomplete contracts, there is a 
possibility of renegotiations where the local partner has little bargaining power. These ne- 
gotiation difficulties can make a firm internalize their production by engaging in FDI rather 
than signing contracts with local firms. 

Early analytical FDI theory based on the OLI principle suggested two distinct motiva- 



tions for FDI, horizontal and vertical FDI (Markusen 1984 Helpman 1984). Horizontal 
FDI (HFDI) is undertaken when firms want to access different markets. Instead of increas- 
ing their production at home and exporting produced goods, they invest in production in 
the foreign country. It is a way to serve a foreign market by partially duplicating the exist- 
ing production. Thus, a similar development level in both host and source country for the 
industry in which the firm is operating may increase HFDI. Reasons to engage in HFDI in 
the first place include the evasion of trade restrictions and reduction of trade costs which 
can be summarized as tariff-jumping FDI. This lowers the marginal costs for the firm and 
may lead to strategic advantage over national firms in the foreign market. Vertical FDI 
(VFDI), on the other hand, leverages lower factor prices in other economies. For example, a 
firm may have an interest to move unskilled labour-intensive activities to countries with low 
wages when the trade costs are relatively low as well. Usually firms with easily segmentable 
production processes tend to engage in VFDI. These two motivations have been unified into 



the knowledge-capital model of FDI (Markusen, 1997). Determinants suggested by these 
theories are trade costs, educational differences and market size. Note that aggregated FDI 
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flows are currently still dominated by horizontal FDI which is closely related to the fact that 
most FDI flows take place between developed countries. 

Developed countries also often engage in trade or investment agreements, which are designed 
to facilitate these international transactions. The question related to these agreements is 
whether they do have the positive impact they are supposed to have and in the case of 
trade agreements how they affect FDI. Liberalization of trade may have a negative effect 
when trade and FDI are substitutes, as in HFDI theory, but trading can lead to a higher 
GDP, which means a greater market size, and that is generally thought to have a positive 
impact on FDI. Overall, trade agreements are expected to positively influence FDI since a 
large portion of world trade is intra-firm. Another means of facilitation of both trade and 
investment are currency unions and these as such should not be expected to have a negative 
impact. 

Tax treaties and taxation in general, intuitively should have a negative impact on FDI. 
However, the empirical results are ambiguous. Taxation reduces the wealth of a firm and 
reduces the possibilities of investment but the laws vary strongly between different coun- 
tries. Firms may be motivated to evade taxes by strategic investment, which could increase 
investment and some countries may try to attract FDI with tax incentives. Tax treaties are 
a way for countries to level the playing field, resulting in higher overall tax. 

An approach, that is very successful and popular in empirical analysis of FDI, uses grav- 
ity equations. Gravity equations have been used for almost half a century to explain ex post 



effects of various country characteristics on bilateral trade flows (Bergstrand and Egger 



2007). In physics, we can observe that the gravitational force is poportional to the size of 



the physical body and inversely proportional to the square of the distance from the physical 
body. This leads to the idea that, as in physics, a greater market size of both host and source 
country positively influences trade flows while distance between the two economies has a neg- 
ative effect. Only recently, a theoretical foundation for the gravity model in trade flows has 



been established (Anderson and van Wincoop, 2003 Bergstrand and Egger, 2007). Since 
trade decisions are closely related to FDI decisions and FDI may be considered a substitute 
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for trade in HFDI theory, the gravity model is a very popular approach in examining FDI 



determinants (Navaretti and Venables, 2006). However, it seems as though FDI behavior 
is much more complicated to model than trade flows and there is no paper that identifies 



gravity variables as the sole determinants of FDI flows (Blonigen, 2005). 

One reason why FDI flows are more difficult to model than trade flows could be because 
the location of investment is a part of the decision. When national firms are looking to invest 
with the intention to export or import, the decision is influenced by potential trade partners 
and whether the conditions in the home country encourage further investment. The actual 
trade decision is not very sensitive to country characteristics, except those that directly affect 
profit. Those may be trade costs that affect the profit margin and market size which has an 
influence on the total demand of goods. National firms have to consider only the risks they 
take when investing in their home country, whereas multinational firms have to evaluate their 
disintegration costs which include all their risks in several different countries. They try to 
find the best country for their investment and many country characteristics have a potential 
influence on the risk an investor takes when making a long-term investment. These range 
from cumbersome regulations to economic or political conditions which in turn may lead to 
early withdrawal or even total loss of investment. While accurately measuring the risks is 
very difficult, estimation of the effects on FDI should be easier. Since individual analysis 
of all countries is too costly even for multinational firms, decisions are often influenced by 
indices that try to measure political, legal and economic conditions. The effect of indices or 
ratings on financial decisions can clearly be seen in the current debt crisis in Europe. For 
FDI, legal protection of assets, corruption and general quality of institutions but also existing 
or potential conflicts are only some of the factors measured by indices. If in a certain country, 
firms have to fear expropriation or difficulties in the repatriation of profits one should expect 
a lower FDI inflow for said country. Corruption increases cost of doing business and in some 
cases may be heavily punished (U.S. Foreign Corrupt Practices Act) while weak institutions 
may be pressured in face of a change of government. Internal and external conflicts may lead 
to unforeseen structural changes and falls in profitability. Internal conflicts can range from 
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civil disorder to civil war caused by religious/ethnic tensions or socioeconomic factors like 
unemployment or poverty. External conflicts can present themselves in foreign pressures, 
like restrictions on operations or trade/investment sanctions, to full-out war. 



4.2 Description of the Dataset 



We use the same set of data as Eicher et al. (2011) which is an unbalanced panel that covers 
the years 1988 - 2000. It contains 803 unique country pairs of 46 countries. 21 of these 
countries are not members of the OECD. The total number of observations is 14863 and 
64 percent of these observations indicate zero FDI inflow. In addition to the observed FDI 
inflows it contains data on 55 potential FDI determinants. The descriptive statistics of the 
log FDI flows and all 55 determinants can be found in Table 1. 



The dataset is based on Razin et al. (2008) which is the source for the gravity variables 
(market size, proxied by the real GDP, and distance), for factor endowments (development 
levels, proxied by real GDP per capita, and educational difference), and various other deter- 
minants - namely, productivity (real GDP per worker), GDP growth rate, financial risk and 
common language. Also, we will be using a variable indicating past FDI flows. Razin et al. 
obtained the FDI outflow data from the OECD International Direct Investment Database 
(OECD) which is noted in U.S. dollars and deflated it by the U.S. Consumer Price Index. 



Market potential has been constructed according to the definition given in Blonigen et al. 



(2007). 



Eicher and Henn (2011) provided the geographical and historical indicators, border and 



colony, but also a list of both multilateral and bilateral trade agreements, as well as currency 



union indicators. Another list of bilateral tax treaties has been obtained from Neumeyer and 



Spess (2005). 



The average effective corporate tax rates have been constructed using the definitions and 



information provided in Altshuler et al. (1988), Blonigen and Davies (2004) and the U.S. 
Treasury Corporate Tax Files. 

Institutional variables and other risk factors have been taken from the International Country 
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Risk Guides (ICRG, 2000), where the exact definition of each index can be found. Included 
in the dataset are indices of host and source country's corruption, bureaucratic efficiency 
and investment profile, a composite index concerned with contract viability/expropriation, 
repatriation of profits and payment delays, but also several political risk indices. These in- 
dices proxy the benefits of democratic accountability, government stability and a strong legal 
system, but also the risks of military government participation, internal/external conflicts, 
ethnic/religious tension and the socioeconomic profile, which includes unemployment and 
poverty. 

4.3 Results 

We ran several chains with 100,000 iterations each, to confirm convergence in the model and 
parameter space. As we can see in Figure 1, the running average number of determinants 
suggested by TBMA converges relatively quickly and is consistent with different starting 
points, for both the selection and the outcome equation. The average percentage of model 
jumps is just above 7%. Using TBMA we obtain 19 determinants for the selection equation 
and 28 for the outcome equation with an inclusion probability greather than 50%. This is 
more than the numbers suggested by HeckitBM A of 13 and 23 but still less than the 23 and 



35 determinants of the Heckit model (Eicher et al. 2011) 



Table 2 of (Eicher et al. , 2011) provides an overview of whether a determinant has been 
expected to have a positive, negative or no effect in the past. Table 3 shows TBMA esti- 
mates of posterior mean and standard deviation, as well as the inclusion probabilities of each 
determinant. The inclusion probability is defined as the ratio of the number of samples to 
the length of the chain after discarding the first 10,000 iterations as burn-in period. 

We find that TBMA gives support for roughly the same determinants as HeckitBMA 
when modeling the magnitude of FDI flows. As expected, the gravity variables host and 
source market size as well as distance have a decisive effect on FDI flows. Surprisingly, while 
distance has a negative and the source country's market size has a positive effect, we find 
that the market size of the host country influences FDI flows negatively. Of the geographical 
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chain length chain length 

(a) selection equation (b) outcome equation 

Figure 1: Running average numbers of determinants for selection and outcome equation of 
2 chains with length 100k. 

and historical determinants only a former colonial connection and common language exert 
a decisive positive effect on the amount of investment. As with HeckitBMA, the levels of 
development are found to be of significance while an effect of educational difference cannot 
be supported. A higher effective tax rate of both host and source country negatively influ- 
ences FDI. GDP growth in the host country, as well as productivity in the host country are 
very likely to have a positive impact, whereas source country GDP growth and productivity 
do not seem to be of importance. We also find the same peculiar behavior, that market 
potential does seem to have an effect although it is a negative one. The effect of regional 
trade agreements, namely APEC and LAIA are supported by both HeckitBMA and TBMA 
but only TBMA suggests a significant effect of the EEA. Both methods find a positive im- 
pact if the same currency of two countries is the U.S. dollar whereas the EURO seems to 
be of little importance, keeping in mind that the examined years range from 1988 to 2000. 
Risk proxies supported by both HeckitBMA and TBMA are corruption in host and source 
country, religious tension and internal conflicts in the host country and bureaucratic quality 
and ethnic tension in the source country. HeckitBMA additionally suggests the investment 
profile of the source country while TBMA shows an effect of the socioeconomic conditions 
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in host and source country and external conflicts in the host country. All risk proxies from 
the International Country Risk Guides are positively oriented, which means we are talking 
about lack of tensions and conflicts or quality of conditions. 

Looking at the selection equation this unison of HeckitBMA and TBMA seems to fade. 
Only five determinants receive high inclusion probabilities from both methods. These deter- 
minants are the levels of development and religious tension in the host country, the effective 
tax rates in both host and source country and a previous FDI relation. HeckitBMA fur- 
ther suggests the regional trade agreements APEC and EFTA, common language, financial 
risk in host and source country, religious tension in the source country as well as military 
government participation and ethnic tension in the host country as determinants with sig- 
nificant effect on the decision whether to enage in FDI. Additional to the five determinants 
supported by both methods, TBMA suggests that a former colonial relationship, distance, a 
common border, host market size, host and source development levels and a few risk indices 
are important for the decision of whether to engage in FDI. These risk indices proxy mainly 
source country risks, namely corruption, ethnic tension, the socioeconomic conditions, mili- 
tary government participation and external conflicts in the source country and only one host 
country risk, strength of the legal system and the obeyance thereof. 

These differences in the inclusion probabilities for the selection equation result from the 
different ways of estimation. HeckitBMA, as a two-stage estimation method, isolates the 
estimations for selection and outcome equation and models the influence of the selection on 
the outcome equation with the Inverse Mills Ratio. In TBMA these two estimations are 
simultaneous and feedback from the outcome to the selection equation is expected. 

5 Conclusions 

Dealing with model uncertainty by averaging parameter estimates according to posterior 
model probabilities has seen a rise in the last few years. MCMC methods are a very efficient 
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wajjjto approximate the posterior model probabilities and achieve Bayesian Model Averag- 
ing, as models with high posterior model probabilities are evaluated more frequently than 
others with low probability The length of the sum for the averaged parameter estimate 
is determined by the length of the Markov chain and not by the total number of models, 
which is particularly useful if we have a huge number of models where many of them have 
very low probability. While the necessary length of the chain is dependent on the number of 
high probability models, MCMC methods help finding these models without the necessity 
to calculate the posterior probability for every possible model. 

Within the MCMC framework, we found a way to implement Bayesian Model Averaging 
for the generalized Tobit model without using approximations, like the BIC, to achieve the 
right mixture of models according to the Bayes factors. Maximizing the likelihood and then 
penalizing for model complexity under the assumption of infinite observations, as BIC does, 
should be avoided especially when data is rare. Within a Gibbs sampler that explores the 
parameter space, it is possible to use the MC3 method with conditional Bayes factors. This 
way, estimates of the posterior model probabilities are unbiased for any number of observa- 
tions, whereas BIC may systematically penalize model complexity too strongly or too weakly 
when dealing with a small dataset. While the structure of the conditional likelihood for the 
conditional Bayes factor is similar in interpretation to the structure of the posterior model 
probability derived from the BIC, it is not necessary to use a flat prior on the model space 
for BMA. We can easily modify step 5 in the Tobit Bayesian Model Averaging algorithm 
(section 4.1) to include various prior beliefs about model probabilities, according to section 
3.4, equation (8). 

Since TBMA is fully justified within the Bayesian framework, we expect more accurate 
parameter estimates in comparison to other BMA results. Also, other hierarchical compo- 
nents, like random effects and time dependencies, can be included more confidently with a 
firm ground to rely on. TBMA easily accomodates additional components, as all it takes is 

lr The whole algorithm, as implemented in section 4.1, takes less than 1 hour of runtime for our dataset 
and a chain of length 100k on an Intel(R) Core(TM) i5-2400 CPU @ 3.1GHz, 8GB RAM, Windows 7 SP1 
64 Bit system using R x64 2.13.1 (no multicore support). 
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another additional Gibbs step, and using the combination of conditional Bayes factors and 
MC3 fits perfectly into the MCMC paradigm. 
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Appendix A 



Parameters for the conditional posterior distribution of Zi 
• For censored observations: 

Hi = wfi 
a? = l 



For uncensored observations: 
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Parameters for the conditional posterior distribution of 4>: 

Si = s + n Q 

s 1 = So + i 2 J2 & - w '^ 2 - 2 ^ E (* - w 'f)(v* - + E (v* - x '^ 2 

i\z{>0 i\zi>0 i\zi>0 



Appendix B 



The covariance structure of the linear regression system is given by the covariance matrix 
E, which is parameterized by 7 and (see section 2.1) and a priori independent from the 
choice of model E M. Restrictions are imposed on the space of the regression parameter 
vector if) = (0',/3')' according to the model My (see section 3) and the dependence is noted 
by ipy. The parameters for the conditional prior normal distribution of ip^ are mean V'o.ij 
and covariance matrix &o,ij- For the definition of y i and Xj, as well as the posterior mean 
ifti ij and covariance matrix ^l,^- see Appendix A. 



pr(D|^-,7,0) = j pr(D|My,^., 7 ,0) \k(w, s M,,)^:, 

oc(2tt)-^/ 2 |* ^r 1/2 
/ eX p I - \ 

J \ l k\z k > 
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+ J2 (yk-x k %y(y k -x k ^ i:j ) 

k\z k <0 
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I exp 
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Table 2: FDI Determinants and their Estimated Effects in the Past 
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Table 3: TBMA estimates for FDI determinants 
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